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PSEUDO-MAXIMUM LIKELIHOOD ESTIMATION 
OF ARCH(oo) MODELS 

By Peter M. Robinson 1 and Paolo Zaffaroni 

London School of Economics and Imperial College London 

Strong consistency and asymptotic normality of the Gaussian 
pseudo- maximum likelihood estimate of the parameters in a wide 
class of ARCH(oo) processes are established. The conditions are shown 
to hold in case of exponential and hyperbolic decay in the ARCH 
weights, though in the latter case a faster decay rate is required for 
the central limit theorem than for the law of large numbers. Partic- 
ular parameterizations are discussed. 

1. Introduction. ARCH(oo) processes comprise a wide class of mod- 
els for conditional heteroscedasticity in time series. Consider, for t G Z = 
{0, ±1, . . .}, the equations 

(1) x t = o- t e u 

oo 

(2) CT t 2 = w + yj^°i x *-i' 
where 

oo 

(3) uj >0, ipoj>® (j>l), X^°i <0 °' 

i=i 

and {et} is a sequence of independent identically distributed (i.i.d.) unob- 
servable real- valued random variables. We shall assume that a strictly sta- 
tionary solution xt to (1) and (2) exists almost surely (a.s.) under (3), and 
call it an ARCH(oo) process. We consider a parametric version, in which we 
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know functions ijjjiC) °f the r x 1 vector for r < 00, such that, for some 
unknown £0, 

(4) ^i(Co) = Vt>j, i>i- 

Also, wo is unknown and xt is unobservable but we observe 

(5) Vt = fM) + x t 

for some unknown hq. 

ARCH(oo) processes, extending the ARCH(m), m < 00, process of En- 
gle [11] and the GARCH(n,m) process of Bollerslev [4], were considered by 
Robinson [29] as a class of parametric alternatives in testing for serial inde- 
pendence of yt- Empirical evidence of Whistler [35] and Ding, Granger and 
Engle [10] has suggested the possibility of long memory autocorrelation in 
the squares of financial data. Taking [contrary to the first requirement in 
(3)] loq = 0, such long memory in x\ driven by (1) and (2) was considered 
by Robinson [29], the ipoj being the autoregressive weights of a fractionally 
integrated process, implying X^iV'Oj = 1; see a l so Ding and Granger [9]. 
For such i^oj, and the same objective function as was employed to generate 
the tests of Robinson [29], Koulikov [20] established asymptotic statistical 
properties of estimates of Co- On the other hand, under our assumption 
too > 0, Giraitis, Kokoszka and Leipus [13] found that such ipoj are incon- 
sistent with covariance stationarity of xt, which holds when X^Li^Oj < 1- 
Finite variance of xt implies summability of coefficients of a linear moving 
average in martingale differences representation of x\\ see [37]. In this pa- 
per we do not assume finite variance of Xt, but rather that xt has a finite 
fractional moment of degree less than 2. The first requirement in (3) was 
shown by Kazakevicius and Leipus [18] to be necessary for existence of an 
xt satisfying (1) and (2). The intermediate requirement in (3) is sufficient 
but not necessary for a.s. positivity of of, and is imposed here to facilitate 
a clearer focus on the ifjQj, which decay, possibly slowly, but never vanish. 

We wish to estimate the (r + 2) x 1 vector 6q = (u;o, /-*0j Co)' on the basis 
of observations yt, t = 1, . .. ,T, the prime denoting transposition. The case 
when fiQ is known, for example, fiQ = 0, is covered by a simplified version 
of our treatment. If the yt were instead unobserved regression errors, we 
have /^o = 0, but would then need to replace xt by residuals in what follows; 
the details of this extension would be relatively straightforward. Another 
relatively straightforward extension would cover simultaneous estimation of 
the regression parameters ujq and Co> after replacing ^0 by a more general 
parametric function; as in (1), (2) and (5), efficiency gain is afforded by 
simultaneous estimation. 

Under stronger restrictions than Y^jLi V'Oj < 1> Giraitis and Robinson [14] 
considered discrete- frequency Whittle estimation of Co > based on the squared 
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observations y| (with known to be zero) , this being asymptotically equiva- 
lent to constrained least squares regression of yf on the J/|_ s , s > 0, a method 
employed in special cases of (2) by Engle [11] and Bollerslev [4]. In these 
the spectral density of y|, when it exists, has a convenient closed form. 
This property, along with availability of the fast Fourier transform, makes 
discrete-frequency Whittle estimation based on the y\ a computationally at- 
tractive option for point estimation, even in very long financial time series. 
However, it has a number of disadvantages, as discussed by Giraitis and 
Robinson [14]: it is not only asymptotically inefficient under Gaussian £t, 
but never asymptotically efficient; it requires finiteness of fourth moments 
of yt for consistency and of eighth moments for asymptotic normality, which 
are sometimes considered unacceptable for financial data; its limit covari- 
ance matrix is relatively complicated to estimate; it is less well motivated 
in ARCH models than in stochastic volatility and nonlinear moving average 
models, such as those of Taylor [33], Robinson and Zaffaroni [30, 31], Harvey 
[15], Breidt, Crato and de Lima [5] and Zaffaroni [36], where the actual likeli- 
hood is computationally relatively intractable, while Whittle estimation also 
plays a less special role in the short-memory-in-yf ARCH models of Giraitis 
and Robinson [14] than in the long-memory-in-y^ models of the previous 
five references, where it entails automatic "compensation" for possible lack 
of square-integrability of the spectrum of y\ . Mikosch and Straumann [26] 
have shown that a finite fourth moment is necessary for consistency of Whit- 
tle estimates, and that convergence rates are slowed by fat tails in St- 

For Gaussian St , a widely-used approximate maximum likelihood estimate 
is defined as follows. Denote by 6 = (u>, fj,, £')' any admissible value of 8q an d 
define 

xtilA = yt -(*■, 



for i € Z, and 



t-i 



a t 2 (fl) = ^ + (C)*L>)l(t > 2) 

3=1 

for t > 1, where l(-) denotes the indicator function. Define also 

9m = 4^ + WW, m = ?M + ww, i < 



T T 

Q T (6) = T~ x J2 ft W. QtW) = T" 1 E 9t(B), 

t=i t=i 

6t = ar g mm Qt(0), 9t = argminQy^)) 
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where is a prescribed compact subset of M. r+ . The quantities with over-bar 
are introduced due to yt being unobservable for t<O;0T is unconfutable. 
Because we do not assume Gaussianity in the asymptotic theory, we refer 
to Qt as a pseudo-maximum likelihood estimate (PMLE). 

We establish strong consistency of 9t and asymptotic normality of T 1 / 2 (6t — 
6>o), as T — > oo, for a class of ^j(C) sequences. In the case of the first prop- 
erty this is accomplished by first showing strong consistency of Oj- and then 
that 9t — Bt 0, a.s. In the case of the second we likewise first show it 
for T x l 2 (dr ~ ) and then show that 8 T - 6 T = o p (T~ 1 / 2 ), but the latter 
property, and thus the asymptotic normality of T 1 I 2 (Qt — #o)> is achieved 
only under a restricted set of possible Co values, and this seems of practi- 
cal concern in relation to some popular choices of the ^j(C)- These results 
are presented in the following section, along with a description of regularity 
conditions and partial proof details. The structure of the proof is similar in 
several respects to earlier ones for the GARCH case of (2), especially that 
of Berkes, Horvath and Kokoszka [3]. Sections 3 and 4 apply the results to 
particular models. 

2. Assumptions and main results. Our assumptions are as follows. 

Assumption A(q), q>2. The Et are i.i.d. random variables with Ee® = 
0, Eeq = 1, -E|eo| 9 < 00 an d probability density function /(e) satisfying 

f(e) = 0(L(\e\- 1 )\e\ b ) ase^O, 

for b > — 1 and a function L that is slowly varying at the origin. 

Assumption B . There exist u>l,u>U: such that < ojl < <^u < 00 j 

— oo < ji^ < fiu < oo, and a compact set T G R r such that = [lol,uju] x 
[fiL^u] x T - 

Assumption C. 8 is an interior point of G. 
Assumption D. For all j > 1, 



for some d > 0; 
for l<k<j, 

where K throughout denotes a generic, positive constant. 



(6) inf^(C)>0; 

(7) sup^-(C) < Kj-4- 

(8) Vo, < Ktp ok 
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Assumption E. There exists a strictly stationary and ergodic solution 
x t to (1) and (2), and for some 

(9) P e((d + i)-\i), 

with d as in Assumption D, we have 

(10) £|x | 2p <oo. 

Assumption F(Z). For all j > 1, Vj(C) nas continuous kth derivative 
on T such that, with Q denoting the ith element of 



(11) 



d(h ■■■d( i 



1-77 



for all 7] > and all = 1, . . . , r, h = 1, . . . , k, k <l. 

Assumption G. For each £ G T, there exist integers ji = i = 

1, . . . , r, such that 1 < < ■ ■ ■ < j r (C) < °o and 

rank{* 0l) ... )ir) (C)} = r, 

where 

*a w (C) = {«i I, (C),...,«i"K)}, *5"(o = ^P- 

Assumption H. There exists 

(12) d > \ 
such that 

(13) ^<Kr l - d \ 

and (10) holds for 

(14) pe(4/(2do+3),l). 

Assumption A(q) allows some asymmetry in £t, but implies the less prim- 
itive condition (which does not even require existence of a density) employed 
in a similar context by Berkes, Horvath and Kokoszka [3]. Assumptions B 
and C are standard. Inequalities (7) and (13) together imply do > d, while 
(8) with (3) is milder than monotonicity but implies i^oj = °{j ) as 3 ~^ °°- 
We take r/ > in Assumption F(l) because ipj{Q < 1 for all large enough j, 
by (7). Assumption G is crucial to the proof of consistency, being used in 
Lemmas 9 and 10 to show that in the limit #o globally minimizes Qt(0); it 
also ensures nonsingularity of the matrix Ho in Proposition 2 and Theorem 
2 below. This and other assumptions are discussed in Sections 3 and 4 in 
connection with some parameterizations of interest. 
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We present asymptotic results for the uncomputable 9t as propositions, 
those for 9t as theorems. All these, and the corollaries in Sections 3 and 4 
and lemmas in Section 5, assume (l)-(5). 

Proposition 1. For some 5 > 0, let Assumptions A (2 + 5), B, C, D, 
E, F(l) and G hold. Then 

6t — ► #o a - s - as T — > oo. 

Proof. The proof follows as in, for example, [17], Theorem 6, from 
uniform a.s. convergence over G of Qt(9) to Q{9) = Eqo(9) established in 
Lemma 7, the fact that Qt(9t) < Qt(9), and Lemma 10. □ 

Theorem 1. For some 5 > 0, let Assumptions A(2 + 5), B, C, D, E, 
F(l) and G hold. Then 

(15) 9t —> 6o a - s - asT^oo. 

Proof. From Lemmas 7 and 8, Qt(&) converges uniformly to Q(9) a.s., 
whence the proof is as indicated for Proposition 1. □ 

Denote by Kj the jth cumulant of et and introduce 

G = (2 + k a )M - 2k 3 (N + N') + P, H = M + \P, 

where 

M = £(t t ), N = E{a^r Q )e' 2 , P = %" 2 )e 2 e' 2 , 

for ro = ro(^o)) T t(6) = (d/d9) log erf (#), and e2 the second column of the 
(r + 2) x (r + 2) identity matrix. In case /j,q is known (e.g., to be zero), we 
omit the second row and column from M, and have instead Go = (2 + k^)M, 
Hq = M. In case et is Gaussian, k^ = = 0, so Go = 2ifo = 2M + P. 

Proposition 2. Let Assumptions A(4), B, C, D, E, F(3) and G hold. 
Then 

T l ' 2 (9 T - O ) A(0, H^GqHq 1 ) asT^oo. 
Proof. Write 

Q«(0 ) = ^ = T-iX>(0), 

t=l 

where 

u t {9) = T t (e){i- x t{e)) + a^{e)v t {9), 
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with 

By the mean value theorem, 

(16) = Q?(9t) = Q ( tH0o) + H T (9 T - ), 

where Ht has as its ith row the ith row of H T {9) = T" 1 Y%=i h(0) evaluated 
at = 6$, where h t (9) = (d 2 /d0dO')Q T (9), \\e^-9 \\ < ||0r-0o||, where we 
define \\A\\ = {tr(A' A)} 1 / 2 for any real matrix A. Now u t (9 ) = T t (9 )(l - 
e 2 ) — 2e2St/(Jt is, by Lemmas 2, 3 and 7, a stationary ergodic martingale 
difference vector with finite variance, so from Brown [6] and the Cramer- 
Wold device, T^ 2 Q^ (0 O ) -> d N(0, G )asT^oo. Finally, by Lemma 7 and 
Theorem 1, Ht — > p Hq, whence the proof is completed in standard fashion. 
□ 

Define 

*W = ^ m = M9)<(9), h t (9) = 3 ^ 

T T 

G T {9) = T~ l Y,9t(9), H T (9) = T'^htid). 
t=i t=i 

Theorem 2. Let Assumptions A(4), B, C, D, E, F(3), G and H hold. 
Then 

(17) T 1 / 2 0t-9 o )Sn(O,Hq 1 G o H o 1 ) asT^oo, 

and Hq 1 GqHq 1 is strongly consistently estimated by H^ 1 {9t)Gt{6t)H^ (6t) 

Proof. We have 

= Q^(9 T ) = qP(0 o ) + H T (9 T - ), 

where Q$\o) = (d/d9)Q T (9) and H T has as its ith row the ith row of Ht(0) 
evaluated at = 9$, for \\0$ - 9 \\ < \\9 T - B \\. Thus, from (16), 

9 T - T = (Hf 1 - H^)Q^(9 ) - H^{Q { ?{9 Q ) - Q?(9 )}, 

where the inverses exist a.s. for all sufficiently large T by Lemma 9. In view 
of Proposition 2 and Lemma 8, (17) follows on showing that 

Q?(9 )-Q?(9 ) = o p (T- 1 / 2 ). 
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The left-hand side can be written (-Bit + B 2 t + B%r)/T, where 

T T T 

B 1T = ^2ejbu, B 2T = -J2(e 2 - l)b 2t , B 3T = -2e 2 ^2e t b 3t , 
t=i t=i t=i 

with 

-2(1)/ 9 -2\ 2(1) -2(1) -2 
"It — —a , 2 t — =T~ ' °3t — —=2 ' 

af af af a fa t 

for a? = a?(flb), a? 1 ) = ^(0 O ), ^« = a?> ), with 
(0/00)of (0), cr 2 t (l) {6) = (8/86)^(8). We show that B iT = OpiT 1 / 2 ), i = 1,2 
For the remainder of this proof, we drop the zero subscript in ifjQj. 
Consider first B\t- We have 



(is) o? w = fi,-2|:^-i,i:^ i) ^ , 



where ij/p = ip^iCo)- Prom Assumption F(l), 

\\a 2 t W\\ < 1 + 2^^1x^1 + K E^-^ti, 
j'=i j'=i 

for all 77 > 0. Now 

t-i /t-i \ 1/2 / 00 \ 1/2 

j=l Vi=l / M=i / 

so since at > ojl > 0, 

t-i 

<j f ~ 2 V] ^j|xt-j| < Ka^ 1 < 00. 



From (8), 



3=1 



t-l 



It follows that 

(19) ||a 2(1) ||M 2 <^. 

On the other hand, by the cv-inequality ([23], page 157) and (10), 

00 00 

(20) E(a 2 - a 2 )" < #X>^k-i| 2 ' < #£Vf 
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Thus, by (8) and (14), 

oo oo 

(21) E\\b lt \\ p < K^p^tf < K^f^ < ^ 1 -^*+ 1 )( 1 -") J 

j=t j=t 

choosing r\ < 1 — I / {p(do + 1)} , which (14) enables. Applying the c r -inequality 
again, 

T 



E\\B 1T \\P<KY^E\eo\ 2p E\\b 



if 



Applying (21), this is O(l) when p > 2/ (do + 1), while when p < 2/ (do + 1), 
we may choose r/ so small to bound it by 

KT 2-p(d +l)(l- V ) < KT p/2-{l+2(d +l)(l-»))}[p/2-2/{l+2(d +l)(l- )? )}] = Q f T p/2\ 

using (12) [which requires (13)] and arbitrariness of 77. Thus, B 1T = o p (T 1 / 2 ) 
by Markov's inequality. 

Consider -E>2T- By independence of £j and bit, by the c r -inequality when 
p<\i and by the inequality of von Bahr and Esseen [34] and the fact that 
the e\ are i.i.d. with mean 1 when p > i, 

T T 

£||i?2T|| 2p < A]T(£| eo | 4p + l)£||&2t f < K^iEWhtW 2 ' + ^IIM 2 "), 
t=l t=l 

where 



a 



2(1) 2(1) 
— Of 



^4t 



07 



55t 



^ (1) K 2 -^ 2 ) 



Thus, from Assumptions F(l) and H, 



HM< hE^-K-ii+EH^ 



07 



<^f" 



( 00 

x 1/2 

f 00 

2 + £ 



f»ii 2 M)^ 



1/2-, 



00 \ !/ 2 



E^. 



1/2 



00 \ 1/2- 

l-2« 2 



'X 



< A" 



-d /2 



+ Ei' 



i +l)(l-2r,) 2 



1/2-, 



kJ =f 



so 
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for sufficiently small rj. Thus, Yl=i E \\ht \\ 2p is 0(1) for p > 2/ (do + 1), while 
for p < 2 /(do + 1), it is bounded by 

KT 2-(d +l)p{l-2ri) < KT p-(d +2){p-2/(d +2)}+2(d +l)pri _ (^P) 

from (14) and arbitrariness of rj. Also, ||&5t|| < K\\af^ /a 2 ||(cr| — of) 1 / 2 , so 
from (19) and (20) we have £||&5i|| 2p < Kt 1 "^ 1 ^ 1 - 2 ^ , and proceeding as 
before, 

f>||M 2p = o(T"), 

i=l 

and thence, Bit = o p (T 1//2 ). 
Next, 



E\\B 3T \\ 2p < KE 



2p 



applying the c r -inequality when p < ^ and von Bahr and Esseen [34] when 
p>\. Now b 3t < (of - a 2 ) 1 / 2 ^" 2 , so from (20), 

T oo 

E\\B 3T \\ 2 »<KJ2Y,^ 
t=i j=t 

< AT{3L(p > 2/(d + 1)) + (lnT)l(p = 2/ (do + 1)) 
+ r 2 -^' +1 )l(p<2/(d + l))} 

= 0(3*), 

much as before. Thence, -B3T = o p (T 1//2 ). 

It remains to consider the last statement of the theorem, which follows 
on standard application of Propositions 1 and 2, Theorem 1 and Lemmas 7 
and 8. □ 

In earlier versions of this paper we checked the conditions in the case of 
GARCH(n,m) models in which the V'j(C) decay exponentially and we allow 
the possibility that the GARCH coefficients lie in a subspace of dimension 
less than m + n; the details are available from the authors on request. How- 
ever, the literature on asymptotic theory for estimates of GARCH models 
is now extensive, recent references including [3, 7, 12, 16, 22, 32], along 
with investigations of the properties of the models themselves; see recently 
[2, 18, 25]. We focus instead on alternative models which have received less 
attention, and for which our theoretical framework is primarily intended. 

We introduce the generating function 

00 

(22) #z;C) = X> 3 W, N<i. 

3=1 
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3. Fractional GARCH models. A slowly decaying class of ARCH(oo) 
weights was considered by Robinson [29] , Ding and Granger [9] and Koulikov 
[20], generated by 

(23) v^;C) = i-(i-^) c , o<c<i, 

where r = 1 and formally 

(24) (i -*-fiiwV' N ^ d>a 

In these references u>o = was assumed in (2), but we assume ujq > and 
generalize (23) as follows. Introduce the functions a,j = aj(Q, bj = bj(C) and, 
for m > 1, n > 0, n + m>r, 

m n 

(25) a(z;C) = ]>>^' b & = 1 - £ > 1); 

i=i j=i 

and for all £ G T, 

(26) aj>0, j' = l,...,m; 6j > 0, j = l,...,n; 

(27) 6(z;C)/0, |*|<1; 

(28) a(z; £) and 6(z; () have no common zeros in z. 
Now take V>( z iC) (22) to be given by 

with a! = d(C) satisfying 

(30) d£(0,l). 

We call xt based on (29) a fractional GARCH, FGARCH(n, do, m) process, 
for d = d(Co). 

Corollary 1. Let ijj(z;Q be given by (29) and (25) with m> 1, n> 0, 
and let d and the aj,bj be continuously differentiable. For some 5 > 0, let 
Assumptions A(2 + <5) ; B, C and E hold, with all ( G T satisfying (26)-(28) ; 
(30) and 

rank jj|r(ai, • • • , a m , h, . . . , b n , d) j = r. 

TTien (15) is true. Let a/so d and t/ie aj,6j 6e thrice continuously differen- 
tiable and do > \ . Then (17) is true. 
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Proof. Denoting by Cj (j > 1) and dj (j > 0) the coefficients of z 3 in 
the expansions of a(z; Q/b(z; (), z~ l {l — (1 — z) d }, respectively, we have 
i/jj(C) = Z)fc=o c j-kdk, j > 1- From [3], the Cj are bounded above and be- 
low by positive, exponentially decaying sequences when n > 1, and are 
all nonnegative when n = 0. Since the dj are all positive, it follows that 
(6) holds. Also, Stirling's approximation indicates that j~ d ~ l /K < dj < 
Kj~ d ~ l , so the ipj(C) satisfy the same inequalities. Compactness of T, 
smoothness of d, and (30), imply d(Q > d, to check (7). The above argument 
indicates that ip j < Kj^ " 1 < Kk~ d °~ 1 < K^ Qk for j > k > 1, so (8) holds, 
and thus Assumption D. With regard to (11), note that {d/dd)ip{z;( t ) = 
— {a(z'X)/b(z;C)} z ~ 1 (^~ z) d ln(l — z), where the coefficient of z J in — — 
z) d ln(l - z) is Y? k =i k ~ ld i-k < KQnj)^- 1 < Kj-^+W-v) < K^'iC) 
for any rj > 0. Derivatives with respect to the a,j,bj are dominated, and 
higher derivatives can be dealt with similarly, to complete the checking of 
Assumption F(Z). To check Assumption G, suppress reference to C in o,, b, 
tjj and 

4>(z) = b{z)- x {\ - (1 - z) d }, 7 (z) = b(zr l a(z), 



and note that 



djjz) 
daj 

dtp(z) 
~db~ 



,m, 



z J ^{z)<f){z), j = l,...,n, 



mf=-^(i- z rio g (i-z). 

ad z 

Choose ji(C) = i for i = 1, . . . , m + n, ( G T, leaving j m +n+i(C) to be deter- 
mined subsequently. Fix £ and write U = ^ , (j i ,...,j r )(0) partitioning it in the 
ratio m + n : 1 and calling its (i, j)th submatrix Uij. We first show that the 
(m + n) x (m + n) matrix U\\ is nonsingular. Write R for the n x (m + re) 
matrix with (i,j)th element Jj—i, and S for the (m + n) x (m + n) matrix 
with (i, j)th element </>j_j+i, where 0j = 7j = for j < 0, and for j > 0, (j)j 
and 7j are respectively given by 

oo oo 
3=1 3=1 

these series converging absolutely for \z\ < 1 in view of (30). Noting that 

ipj is given by (d / 'd()ip(z) = Yl'jLi ipj^z-i, we find that the first m rows of 
Un can be written (I m ,0)S, where I m is the m-rowed identity matrix, O 
is the m x n matrix of zeroes and, when n > 1 the last n rows of U\\ can 
be written RS. Now S 1 is upper-triangular with nonzero diagonal elements. 
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Thus, for n = 0, U\\ = S is nonsingular. For n > 1, C/n is nonsingular if and 
only if the n x n matrix i?2 having (i,j)th element 7 m +j_j and consisting 
of the last re columns of R is nonsingular. This is not so if and only if the 
jj, j = m, . . . ,m + n — 1, are generated by a homogeneous linear difference 
equation of degree n — 1, that is, if there exist scalars Ao, Ai, . . . , A n _i, not 
all zero, such that 



n-l 



A o7j - X! ^*7j-t = °> j = m,...,m + n-l. 

i=i 

But it follows from (25) and (27) that they are generated by the linear 
difference equation 

n-l 

7j - 51 & i7j-i = n 3> j = m,...,m + n-l, 
i=l 

where 7r m = a m + 6 n 7 m _ n , 7Tj = b n jj- n for j = m + 1, . . . , to + re - 1. Since 
6 n 7^ 0, the ttj are all zero if and only if -y m -n = —o- m /b n and 7j = for 
j = m + l — n, ...,m — 1. But this implies 7 m = also, and thence, 7j = 0, 
all j > m — n + 1. For m < n, this is inconsistent with the requirement a,- > 0, 
j = 1, . . . , to, and for to > re, it implies a has a factor b, which is inconsistent 
with (28). Thus, U\\ is nonsingular when n > 1. Nonsingularity of £/ follows 
if U 22 + U 21 U^U 12 . For large enough j m + n _|_i = j m + n _|_i (^) , this must be 
true because U22 decays like (lnjV»4^+i)jro+n+i' whereas the elements of 
?7i2 are 0(/3 Jm + n+1 ) for some /? G (0,1). Thus Assumption G is true, and 
thence (15). Clearly (13) is true, so under the additional conditions so is 
Assumption H, and thence (17). □ 

For m = 1, re = 0, (29) reduces to (23) when a± = 1, while when a± G (0, 1), 
it gives model (4.24) of Ding and Granger [9]. The important difference be- 
tween these two cases is that the covariance stationarity condition ip{l] Co) < 
1 is satisfied in the second but not in the first. In general with (29), as with 
the GARCH model, x% is covariance stationary when a(l;Co) < Co) but 
not otherwise. We compare (29) with 

(31) vfeo=i- {1 :; ( !' c)} (i-^. 

b{z;Q 

with d again satisfying (30) and a and b again given as in (25), though we 
now allow to = 0, meaning a(z; £) = 0. Thus, with to = re = 0, (31) reduces 
to (23). ARCH(cx)) models with ip given by (31) were proposed by Bail- 
lie, Bollerslev and Mikkelsen [1] and called FIGARCH(re, do, to). In general, 
though (31) also gives hyperbolically decaying ipoj, it differs in some no- 
table respects. Application of (26)-(28) again ensures positivity of ^'(C) in 
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case of FGARCH and facilitates the above proof, but sufficient conditions 
in FIGARCH are less apparent in general, though Baillie, Bollerslev and 
Mikkclsen [1] indicated that they can be obtained. Also, unlike FGARCH, 
FIGARCH xt never has finite variance. 

The requirement do > ^ for the central limit theorem in Corollary 1 would 
also be imposed in a corresponding result for FIGARCH. This is automati- 
cally satisfied in GARCH models but if only do G (0, |] in (13) is possible in 

the general setting of Section 3, it appears that the asymptotic bias in 9t is 
of order at least T -1 / 2 , whereas that for 9<p is always o(T -1 / 2 ). Assumption 
H copes with the replacement of of (0) by <r 2 (#), the truncation error vary- 
ing inversely with do- Inspection of the proof of Theorem 2 indicates that 
this bias problem is due to the term H^Bit- The factor of — of in bit is 
nonnegative, and if j 1 is an exact rate for ipoj, o 2 — of exceeds t~ d °/K 

as t — ► oo with probability approaching one. So far as the factor (tl /af in 
b\t is concerned, the second element of o 2 ^ 1 ) [see (18)] has zero mean, but 

the first is positive, and though the ip^ can have elements of either sign, 
whenever do < \ it seems unlikely that the last r elements of Bit can be 
o p (T 1 / 2 ). Nor is there scope for relaxing (12) by strengthening other condi- 
tions. With regard to implications for choice of p, when do > 2d + ^, (14) 
entails no restriction over (9). 

Though results of Giraitis, Kokoszka and Leipus [13] indicate existence of 
a stationary solution of (l)-(3) when ^>(l;£o) < 1> Kazakevicius and Leipus 
[19] have questioned the existence of strictly stationary FIGARCH processes, 
and thus the relevance of Assumption E here. The same reservations can 
be expressed about FGARCH when a(l;Co) > &(l;Co)j an d more generally 
about ARCH(oo) processes with > 1. A sufficient condition for (10) 

can be deduced as follows. Recursive substitution gives 

oo / oo oo \ 

i=i \ji=i j l= i / 

so by the c r -inequality, 

oo / oo oo 
1=1 Vji=l ji=l 



v If- \ 2 P \r \ 2 P 
* pt-.jl-.72 I ' ' ' l fc t-Ji Jl I 



Thus, from Lemma 2, 



E\x t \ 2p < E\o t \ 2p <K + KJ2[ E\e \ 2p £ . 



1=0 \ j=l 
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The last bound is finite if and only if 

oo 

(32) £| £0 | 2 'XX<1. 

i=i 

Thus, (10) holds if there is a p satisfying (9) and (32). Recursive substi- 
tution and the c r -inequality were also used by Nelson ([27], Corollary) to 
upper-bound E\at\ 2p in the GARCH(1,1) case, but he employed the simple 
dynamic structure available there, and (32) does not reduce to his necessary 
and sufficient condition. 

If ^(1; Co) < 1, (32) adds nothing because we already know that Exq < oo 
here, but if Co) > 1) the second factor on the left-hand side of (32) 
exceeds 1 and increases with p; the question is whether the first factor, 
which is less than 1 and decreases with p [due to Assumption A(q)], can 
over-compensate. Analytic verification of (32) for given Co> p seems in general 
infeasible, and numerical verification highly problematic when the tpj decay 
slowly. However, consider the family of densities 

(33) f(s) = exp[-{a(7)k|} 1 ^]/{2 7 r(7)a(7)} 

for 7 > 0, where 0(7) = {r (7) /T (37)} 1 / 2 (also used by Nelson [28] to model 
the innovation of the exponential GARCH model). We have Esq = 0, Ee\ = 
1 as necessary, Assumption A(q) is satisfied for all q > 0, and £ , |eo| 2p = 
r((2p + l)7)/{r(7) 1 -T(37)/'}. In case 7 = 0.5, (33) is the normal density, 
for which 9t is asymptotically efficient. Here E\eq\ 2p = 2 p T(p + §.h)/ yjlr, 
and numerical calculations for FIG ARCH (0, do, 0) cast doubt on (32). In 
case 7 = 1, (33) is the Laplace density, with E\e \ 2p = 2 p ~ 1 T(2p + 1). As 
7 increases, -E|eo| 2p can be made small for fixed p < 1, for example, with 
p = 0.95, it is 0.64 when 7 = 10 and 0.42 when 7 = 20. 

4. Generalized exponential and hyperbolic models. FGARCH(n, do,m) 
[and FIGARCH(n, do, m)] processes require do € (0,1). For d = 1, (29) re- 
duces to (23), and for d> 1, at least one coefficient in the expansion of (23) 
is negative, leading to the possibility of negative ipj(0- Because FGARCH 
fpj(C) decay like j~ d ~ 1 , a large mathematical gap is left relative to GARCH 
processes. Even if exponential decay is anticipated, there is a case for more 
direct modeling of the ipj(C) than provided by GARCH(n, m), since it is 
the tpj(C) and their derivatives that must be formed in point and interval 
estimation based on the PMLE. 

Consider the choices 

m 

(34) Vi(o = E r (^ + ir 1 ^ 1 /^-*', 

i=l 
m 

(35) Vi(C) = E r (Z* + ir^ln^j + 1)0' + I) - *" 1 , 

i=l 
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where d = d(() and the = ej(C), fi = fi(Q are such that T satisfies 

(36) d€(0,oo), 

(37) ej > 0, i = I, ... ,m, 

(38) </!<■•■< / m < oo, 

with 2m + 1 > r. Given (l)-(4) and (22), we call xt generated by (34) a 
generalized exponential, GEXP(m), process, and xt generated by (35) a 
generalized hyperbolic, GHYP(m), process. Condition (38) is sufficient but 
not necessary for ipj(C) > 0, all j > 1. By choosing m large enough in (34) or 
(35), any finite V(1;C) can be arbitrarily well approximated, but (34) and 
(35) can also achieve parsimony. For real x > 1, x^e~ dx and (Inxyx - ^ 1 
decay monotonically if / = 0, and for / > 0, have single maxima at f/d 
and ef/^ d+l \ respectively. Thus, with m = 1 and /i = 0, we have monotonic 
decay in (34) and (35); otherwise, both can exhibit lack of monotonicity, 
while eventually decaying exponentially or hyperbolically. The scale factors 
in (34) and (35) are so expressed because x^e~ dx and (lnx)^x~ d ~ 1 integrate 
over (0,oo) to r(/ + l)/d f+1 and T(f + l)/d, respectively, so that - 
Y^ILi &i in both cases, but the approximation may not be very close and the 
"integrated" case is less easy to distinguish than in GARCH and FGARCH 
models (though it would be possible to alternatively scale the weights by 
infinite sums to achieve equality). 

The following corollary covers (34) and (35) simultaneously, and implies 
the special case when the fi are specified a priori, for example, to be non- 
negative integers; strictly speaking, when the true value of f\ is unknown, 
Assumption C prevents it from being zero. 



Corollary 2. Let ip(z;Q be given by (22) and (34) or (35) with m>l 
and let d and the e% % ji be continuously differentiable. For some 5 > 0, let 
Assumptions A(2 + S), B, C and E hold, with all £ G T satisfying (36)-(38) 
and 

rank|^(ei,/i,...,e m ,/ m ,d)| =r. 
Then (15) is true. Let also d and the e%,fi be thrice continuously differen- 



tiable and Assumption A(4) hold, and do 
(17) is true. 



d(Co) > o i n case °f (35). Then 



Proof. Given (36)-(38) and the proofs of Corollaries 1 and 2, the verifi- 
cation of Assumptions D and F(Z) is straightforward. We check Assumption 
G for (35) only, a very similar type of proof holding for (34). We have 



(i) 



ij,--- 



u 



d(j + l) 



-d-l 
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where 



= (lnln(j + 1) - (8/dfi) lnr(/ l+1 ), l)'ln*(j + 1), i = 1, . . . ,r, 

m 



V4 



i=l 



and E is the diagonal matrix whose (2i — l)st diagonal element is ej, and 
whose even diagonal elements are all 1. Fixing £, we show first that the lead- 
ing (r — 1) x (r — 1) submatrix of ^(j lt ...,j r )(C) nas full rank, equivalently, 
that U m has full rank, where, for i = 1, . . . ,m, the (2i) x (2i) matrix Ui has 
(k,£)th 2x1 sub-vector Ukj e , k = l,...,i, £ = !,.. .,2i. Suppose, for some 
i = 1, . . . ,m — 1 and given ji, . . . , j2i, that C/j has full rank, and partition the 
rows and columns of C/j+i in the ratio 2i : 2, calling its (fc,£)th submatrix Um 
(so C/n = C/j). Take j2i+2 = iii+i- Because In In a; strictly increases in x > 1, it 
follows that U22 is nonsingular and HC/^il = 0(lnln j2i+i ln _ ^ +1 321+1) ■ Not- 
ing that \\U12W = 0(lnln j2i+i ln^ 1 j2i+i), while U\\ and U21 depend only on 
ji, . . . , j2i, we can choose ]2i+i such that U\\ — U^U^^ix differs negligibly 
from U\\. Thus, Ui+i has full rank. Since, for fx > 0, U\ has full rank (e.g., 
when ji = 1, J2 = 2), it follows by induction that U m has full rank. Since 
Vj is dominated by a term of order ln^ m+1 j, while = 0(lnln j hr* j), a 

similar argument shows that j r can then be chosen large enough, to complete 
verification of Assumption G. □ 

5. Technical lemmas. Define 

00 00 

af(9) =u + J2 ^j(()x 2 t-j, <yf = uj u + Y j sup^(C)a£.j. 
3=1 i=i^ T 

Lemma 1. Under Assumptions B and D, for all 6 6 0, i £ Z, 
K- 1 af{6)<al{e)<Kaf{e) a.s. 

Proof. A simple extension of [21], Lemma 1. □ 

Lemma 2. Under Assumptions A(2), B, C, D and E, for all t G Z, 

(39) £M 2p < Acr 2p < Asupa 2p (#) < KSfff'' < A'A|x t | 2p < A', 

(40) infcr 2 (#)>0, sup (8) < Kaf < 00 a.s., 

eee eeQ 

(41) A sup I In cj 2 (0)| < A. 

eee 
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Proof. With respect to (39), the first inequality follows from Jensen's 
inequality, the second is obvious, the third follows from Lemma 1, the fourth 
follows from the c r -inequality, (7) and (9), while the last one is due to (10). 
The proof of (40) uses Lemma 1, of (6>) > u L , (10) and [23], page 121. To 
prove (41), | lnx| < x + x" 1 for x > and Lemma 2 give 

£sup|lncr t 2 (#)| < p^Esupat? (9) + e{ inf a?(9)X <K. 

□ 

Lemma 3. Under Assumptions D, E and F(l), for all 9 e 6, erf (9), qt{9) 
and their first I derivatives are strictly stationary and ergodic. 

Proof. Follows straightforwardly from the assumptions. □ 

Lemma 4. Under Assumption A(2), for positive integer k < (b + l)n/2, 

/ n \ -k 

(42) H^17 <0 °' 

Proof. Denote by Mx(t) = E(e ) the moment-generating function of 
a random variable X. By Cressie et al. [8], the left-hand side of (42) is 



(43) 



proportional to 

roc roo 

J * fe ~ lM 5>?H)^ = J o t^M^i-^dt 

< / t k ~ 1 dt+ / t*- 1 Af^(-t)tft. 
Jo Ji 

It suffices to show that the last integral is bounded. For all 5 > 0, there exists 
r) > such that L(e^ 1 ) < e~ 5 , e G (0,^), so 

e~ t£2 f{e)de<K / e^V" 5 de + le^ . 

-oo JO 

The last integral is bounded by 

Kt (S-b-l)/2 r e -e £ V-b-X)/2 de < Kt {6-b-l)/2_ 



Thus, (43) is finite if k + n(5 — b — l)/2 < 0, that is, since 5 is arbitrary, if 
k<(b+l)n/2. □ 

The previous version of the paper included a longer, independently ob- 
tained, proof of the following lemma which we have been able to shorten 
in one respect by using an idea of Berkes, Horvath and Kokoszka [3] in a 
corresponding lemma covering the GARCH(n, m) case. 
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Lemma 5. Under Assumptions A(q), B, C and D, for p < q/2, 



< K < oo. 



6»G0 



Esup, 2 



Proof. We have 



of = w + ^oia^-i + ^ vpojXt-j <ujo + ipoiOt-i'H-i + - f!:c7 t 2 _ 1 

3=2 

from (8). Thus, cr|/cr 2 _ 1 < if (l + ef^) and thence, for fixed j > 1, of/of ■ < 
Khtj, where /it-,- = nj=i(l + £ 1-i)- For any M < oo, 



07 



M 9 \ - 1 



KhtM_/_ {inf C g T i'lf; : ',/ (C)} 

The proof can now be completed much as in the proof of Lemma 5.1 of [3], 
using Holder's inequality as there but employing our Lemma 4 and taking 
M >2pq/[(b+l)(q-2p)}. □ 

Lemma 6. Under Assumptions A(2), B, C, D, E and F(l), for all p> 
and k < I, 



(44) 
(45) 



E'sup 

eee 

E'sup 

eee 



1 



af (6) ae il ---de ik 

1 d k af(e) 



a 2 t {9) do h ---de, 



< oo, 



< oo. 



Proof. Take i\ < i<i < ■ ■ ■ < if.. First assume i\ > 3, whence, for given k 
and ii, . . . ,i k , 

where &(C) = ^(O/dCn-2 • ■ • 3C ife _2- Now 



so using Lemma 1, 



<2^|C i (OI(^ 2 - i + /i 2 

3=1 



3=1 



o-?{0) 
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It suffices to take p > 1. By Holder's inequality, 

oo ( oo \ p/P ( oo "1 

E I^(0l4-i < E m)\ Plp m) 1 - p/p * 2 t- j E 

j=i ij=i j ij=i j 

so 

^^Pir^ Y - K % m)\ p uo p - p \^\ 2p - 

By Assumption F(Z), for all 77 > 

sup|Ci(C)r^(C) p ~ P < if sup ^(O^ < Kj-te-W-np). 

Since p(d + 1) > 1, we may choose 77 such that (d+ l)(p — pr]) > 1. Thus, 

2 ^ <oo. 

The above proof implies that also 



whence, the proof of (44) with i\ > 3 is concluded. Next take i\ = 2. If i% > 2, 
where now = <9 fc ~Vj(0 /d& 2 -2 • • -d(i k -2, while if z 2 = 2, i 3 > 2, 

--»f<.(0. 



where now £j(C) = d k ~ 2 tpj{Q) / dC,i A -2 ■ ■ -dCi k -2- In the first of these cases 
the proof is seen to be very similar to that above after noting that, by the 
Cauchy inequality, (46) is bounded by 

{00 00 "\ 1/2 00 

E&(0k 2 -;E 1^(01 +^E I6(0i, 
3=1 3=1 j 3=1 

while in the second it is more immediate; we thus omit the details. We are 
left with the cases i\ = %i = 23 = 2 and i\ = 1, both of which are trivial. 
The details for (45) are very similar (the truncations in numerator and 
denominator match), and are thus omitted. □ 
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Define 



T 



g t {9) = u t {9)u' t {9), G T {9) = T' 1 £>(0). 



t=i 



Lemma 7. For some 5 > 0, under Assumptions A(2 + 5), B, C, D, E 
and F(l), 



(47) 



sup\Q T (0)-Q(0)\^0 a.s.asT^oo, 
flee 



and Q{9) is continuous in 9. If also Assumption F(2) holds, 



(48) 



sup||Gr(0)-G(0)||->O a.s.asT^oo, 
0eO 



and G(9) is continuous in 9. If also Assumption F(3) holds, 



(49) 

and H(9) is continuous in 



sup\\H T (9)-H(0)\\^O a.s.asT^oo, 
6>e0 



Proof. To prove (47), note first that, by Lemmas 1, 2, 3 and 5, 

supE\q (9)\ < supE\ log<7o(#)l + sup^xo^) < °o. 
e e e 



Thus, by ergodicity 



Qt{9)^Q{9) a.s., 



for all 9 E 0. Then uniform convergence follows on establishing the equicon- 
tinuity property 

sup \Qt(9) - Q T (9)\ ^ a.s., 

B: ||0-0||<£ 

as e — > 0, and continuity of Q{9). By the mean value theorem it suffices to 
show that 



sup 

e 



3Q T {9) 



06 



+ sup 




dQ(9) 



00 



< oo 



a.s., 



which, by Loeve ([23], page 121) and identity of distribution, is implied 
by i?supe ||iio(#)|| < oo. By the definition of ut{9), and xl(n) < K(xf + 1), 
\\vt(9)\\ < 2(\x t \ + 1), we have 



\Ut 



< K 



l + £ 



2 °t 



M — — + 1 
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Thus, EsupQ \\uq(9)\\ is bounded by a constant times 



£su P ||t o (0)|| + 




E sup 




On 



1" 



+ -Esup 



Of) 



o 2 WJ 

+ 1 



1/p 



EsupHro^)!!^- 1 




l-i/p 



e Uo(0). 

for all p > 1. On choosing p < 1 + 5/2, this is finite by Lemmas 5 and 6. (Our 
use of Lemmas 5 and 6 is similar to Berkes, Horvath and Kokoszka's [3] use 
of their Lemmas 5.1 and 5.2 in the GARCH(ra, m) case.) This completes the 
proof of (47). Then (48) and (49) follow by applying analogous arguments to 
those above, and so we omit the details; indeed, (48) and (49) are only used 
in the proof of consistency of Gt(0t), Ht{9t) for Go, Ho, where convergence 
over only a neighborhood of #n would suffice. □ 

Lemma 8. Under Assumptions A(2 + <5), B, C, D, E and F(l), 

(50) sup|Q T (0) -Qt(0)\-+0 a.s. asT^ oo. 

eee 

If also Assumption F(2) holds, 



(51) 



sup||G T (0)-G T (0)IHO 
eee 



a.s. as T — > oo. 



If also Assumption F(3) holds, 



(52) 



sup \\H T (9) - H T (0) || ^0 a.s. asT^oo. 

eee 



Proof. We have Q T (6) - Q T {0) = A T {9) + B T {6), where 



A T (e) = T- 1 J2^ 

Because 



t=l 



t=i 



3=0 

ln(l + x) < x for x > and of (#) > lol > 0, it follows that 

t=i 

T oo 

<KT- x Y.H^>t 3 iti 
t=i j=t 

oo ( t+T ~\ 

t=o lj=t+i ) 
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Now from (7), 

t+T t+T 

sup ^j(0<K J~ d-1 <-K'min(t + l,T)(t + l)-^- 1 . 
( eT j=t+i j=t+i 

Thus, 

T oo 

(53) sup^T(e) < KT- 1 J2(t + irHx~t + 1) + ^ E r -~\ x -t + !)■ 

6 t=0 t=T 

From the c r -inequality, (9) and (10), J2t^i(t + l)~-~ 1 x 2 Lt nas finite pth mo- 
ment, and thus, by Loeve ([23], page 121), is a.s. finite. Thus, the second 
term of (53) tends to zero a.s. as T — ► oo, while the first does so for the same 
reasons combined with the Kronecker lemma. Next, 

T oo 

\B T {6)\ < kt- 1 £ xM £ ^ (Oxl, (/*) 
t=l j=t 

(54) 

T oo 

t=i j=t 

From previous remarks, Y^jLt 3 ~ l ( x 1-j + 1) — ► a.s. Also, for each 6, a.s. 
T-± XtW ^E XaV )<K{E{£^ + l}<K 

by ergodicity and Lemma 5. Thus, (54) — > a.s. by the Toeplitz lemma. 
The convergence is uniform in 9 because, from the proof of Lemma 7, for all 
0e9, 

sup 11X0(0) - Xo(0) II ->0 a.s., 
8: \\0-6\\<e 

as e — > 0. This completes the proof of (50). We omit the proofs of (51) and 
(52) as they involve the same kind of arguments. □ 

Lemma 9. For some 5 > 0, under Assumptions A(2 + 6), B, C, D, E, 
F(l) and G, M(9) is finite and positive definite for all 9 G 0. 

Proof. Fix 9 € O. Finiteness of M(9) follows from Lemma 6. Positive 
definiteness follows (by an argument similar to that of Lumsdaine [24] in 
the GARCH(1, 1) case) if, for all nonnull (r + 2) x 1 vectors A, X'M(9)X = 
E{X't (9)} 2 > 0, that is, that 

(55) \'t (9)(t%(9) /0 a.s., 
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since < Oq{6) < oo a.s. Define 

T tu) (e) = ^\na 2 t {9) = a^(9), 

ft 00 
r t ,(6) = — lno?(0) = -2(rr 2 (0)£^(C)s*-j(M), 

r tc (0) = ^Ino?(fl) = ar 2 WEV'5 1) (04-i(M), 
^ j=i 

so that r t (0) = (r to (6»), T t/1 (9), 7^(0))'. Write A = (Ai, A 2 , A' 3 )', where Ai and 
A2 are scalar and A3 is r x 1. Consider first the case Ai = A2 = 0, A3 7^ 0. 
Suppose (55) does not hold. Then we must have 

00 

EA 3 ^ 1) (O^(m)=0 a.s. 
3=1 

If X' 3 tPi\c) + 0, it follows that 



00 

(56) (at^et-t + Mo - //) 2 = -{AWf(C)} -1 E ^fHO^-M 

3=2 

Since cr t _i > a.s., the left-hand side involves the nondegenerate random 
variable e t -i, which is independent of the right-hand side, so (56) cannot 
hold. Thus, A 3 ipj^iC) = 0- Repeated application of this argument indicates 

that, for all (, X'^tpj (C) = 0, j = 1, • • • , jr(C)- This is contradicted by As- 
sumption G, so (56) cannot hold. Next consider the case Ai = 0, A2 7^ 0, 
A3 = 0. If (56) does not hold, we must have 

00 

(57) E^-(C)^i(M) = a.s. 

i=i 

Let k be the smallest integer such that iftk(C) 0- Then (57) implies 
et-k = (Tt} k (9)<H- Ho-tp^iO E *l>jW) x t-j(v) (■ 

{ j=k+l ) 

But the left-hand side is nondegenerate and independent of the right-hand 
side, so (57) cannot hold. Next consider the case Ai = 0, A2 7^ 0, A3 7^ 0. If 
(55) is not true, then, taking A2 = 1, we must have 



(58) E^f^-iM " 2^(C)K-iGu) = 
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Let k be the smallest integer such that either \'^ k (C) 7^ or ipk{(,) 7^ 0; 
the preceding argument indicates that there exists such k. Then we have 

{2^(0 - (Oi^t-k^t-k + Mo - n)}{o t -k£t-k + Mo - m} 

oo 

= E (CK->)-2^(C)K-;M a.s. 

j=k+i 

The left-hand side is a.s. nonzero and involves the nondegenerate random 
variable £t—ki which is independent of the right-hand side, so (58) cannot 
hold. We are left with the cases where Ai 7^ 0. Taking Ai = — 1 and noting 
that a^(9)T tuJ (9) = 1, the preceding arguments indicate that there exist no 
A2 and A3 such that 

\2<J 2 t {0W(e) + X' 3 af(e)T K (9) = 1 a.s. 

□ 



Lemma 10. For some 5 > 0, under Assumptions A(2 + 5), B, C, D, E, 
F(l) and H, 

inf Q{9) > Q(9 ). 
eee 

0^00 



Proof. We have 
Q(6)-Q(6 )=E 



<7n 



In 



a\9) 



1 



The second term on the right-hand side is zero only when \jl = \xq and is 
positive otherwise. Because x — In 2; — 1 > for x > 0, with equality only 
when x = 1, it remains to show that 



(59) 



lno"o(6>) = lno"o a.s., some 9 7^ 



By the mean value theorem, (59) implies that (9 - O )V O (0) = a.s., for 
9 7^ 9q and some 9 such that \\9 — 9q\\ < \\9 — 9q\\. But by Lemma 9 there is 
no such 9. □ 
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